A Segment-based Dynamic Programing Algorithm for Parsing Gene Structure (Running Head: Segment-based Dynamic Programming)

نویسنده

  • Thomas D. Wu
چکیده

Note: This version is a preliminary draft. Comments and suggestions are welcome. Abstract Predicting gene structure requires search within a combinatorially large space of possible gene structures. The search space may be narrowed by two types of computational tools: optimality criteria and consistency constraints. Consistency constraints are requirements concerning reading frame and stop codons, namely: the total exon length must be a multiple of three; exons may not contain internal stop codons in their reading frame; and exon-exon junctions may not form stop codons in their reading frame. I present a segment-based dynamic programming algorithm that explores the space of globally consistent gene structures, and finds the optimally scoring gene structure within that space. The algorithm may be modified to provide an arbitrary number of near-optimal solutions and to allow cardinality constraints that limit the number of exons in the gene structure. The algorithm maintains reading frame information that may be used to improve scoring estimates of the likelihood of exons. Segment-based dynamic programming has a running time that is expected to be between linear and quadratic with respect to sequence length, depending on the scoring scheme used. I use the algorithm to explore the power of various constraints by comparing their search spaces. The results show that consistency and cardinality constraints reduce the search space by large orders of magnitude.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Open pit limit optimization using dijkstra’s algorithm

In open-pit mine planning, the design of the most profitable ultimate pit limit is a prerequisite to developing a feasible mining sequence. Currently, the design of an ultimate pit is achieved through a computer program in most mining companies. The extraction of minerals in open mining methods needs a lot of capital investment, which may take several decades. Before the extraction, the p...

متن کامل

Identification of High Crash Road Segment using Genetic Algorithm and Dynamic Segmentation

This paper presents an evolutionary algorithm for recognizing high and low crash road segments using Genetic Algorithm as a dynamic segmentation method. Social and economic costs as well as physical and mental injuries make the governments perceiving to road safety indexes in order to diminish the consequences of road accidents. Due to the limitation of budget for safety...

متن کامل

Strategy-aligned fuzzy approach for market segment evaluation and selection: a modular decision support system by dynamic network process (DNP)

In competitive markets, market segmentation is a critical point of business, and it can be used as a generic strategy. In each segment, strategies lead companies to their targets; thus, segment selection and the application of the appropriate strategies over time are very important to achieve successful business. This paper aims to model a strategy-aligned fuzzy approach to market segment ev...

متن کامل

Automatic Prostate Cancer Segmentation Using Kinetic Analysis in Dynamic Contrast-Enhanced MRI

Background: Dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) provides functional information on the microcirculation in tissues by analyzing the enhancement kinetics which can be used as biomarkers for prostate lesions detection and characterization.Objective: The purpose of this study is to investigate spatiotemporal patterns of tumors by extracting semi-quantitative as well as w...

متن کامل

An Efficient Matching Algorithm for Segment-Based Stereo Vision Using Dynamic Programming Technique

An efficient matching method for segment-based stereo vision is proposed. A potential matching graph which describes the connectivity between candidate matching pairs of segments is built. Establishing correspondence is then reduced to a problem of searching for the optimal path that maximizes a similarity measure. The optimal path is found efficiently without the adverse effects of combinatori...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996